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Spinocerebellar ataxia type 3 is one of the polyglutamine (polyQ) diseases, which are caused by a CAG-repeat 
expansion within the coding region of the associated genes. The CAG repeat specifies glutamine, and the 
expanded polyQ domain mutation confers dominant toxicity on the protein. Traditionally, studies have 
focused on protein toxicity in polyQ disease mechanisms. Recent findings, however, demonstrate that the 
CAG-repeat RNA, which encodes the toxic polyQ protein, also contributes to the disease in Drosophila. To 
provide insights into the nature of the RNA toxicity, we extracted brain-enriched RNA from flies expressing 
a toxic CAG-repeat mRNA (CAG100) and a non-toxic interrupted CAA/G mRNA repeat (CAA/G105) for micro- 
array analysis. This approach identified 160 genes that are differentially expressed specifically in CAG100 
flies. Functional annotation clustering analysis revealed several broad ontologies enriched in the CAG100 
gene list, including iron ion binding and nucleotide binding. Intriguingly, transcripts for the Hsp70 genes, 
a powerful suppressor of polyQ and other human neurodegenerative diseases, were also upregulated. We 
therefore tested and showed that upregulation of heat shock protein 70 mitigates CAG-repeat RNA toxicity. 
We then assessed whether other modifiers of the pathogenic, expanded Ataxin-3 polyQ protein could also 
modify the CAG-repeat RNA toxicity. This approach identified the co-chaperone Tpr2, the transcriptional 
regulator Dpld, and the RNA-binding protein Orb2 as modifiers of both polyQ protein toxicity and CAG- 
repeat RNA-based toxicity. These findings suggest an overlap in the mechanisms of RNA and protein- 
based toxicity, providing insights into the pathogenicity of the RNA in polyQ disease. 



INTRODUCTION 

A large number of neurodegenerative diseases are caused by 
unstable trinucleotide repeat expansions in the associated 
genes (1). Based on pathogenic mechanisms, these disorders 
are categorized into different groups, including loss of 
protein function, RNA toxicity and dominant protein-based 
toxicity. Myotonic dystrophy type 1 (DM1) and myotonic dys- 
trophy type 2 (DM2) are thought to be caused primarily by a 
gain-of-function RNA toxicity. CUG and CCUG RNA expan- 
sions in the non-coding regions of the dystrophia myotonica- 
protein kinase (DMPK) gene and the zinc finger 9 (ZNF9) 
gene cause DM1 and DM2, respectively (2). Polyglutamine 
(polyQ) diseases are dominantly inherited disorders expanded 
CAG repeats within the coding region of the associated genes, 
such as Huntington's disease (HD) and the spinocerebellar 



ataxias (SCA1, 2, 3, 6, 7 and 17) (3). A pathological hallmark 
of these diseases is ubiquitinated nuclear inclusions (NIs) con- 
taining chaperones and subunits of the proteasome (3). 

The prevailing hypothesis for polyQ disease mechanisms is a 
protein effect due to the polyQ. Upregulating chaperones sup- 
presses toxicity and supports the hypothesis that protein mis- 
folding or aberrant interactions of the mutant protein cause 
disease (4). Therefore, studies have traditionally focused on 
protein-based mechanisms of toxicity conferred by the 
expanded polyQ domain. Recent studies, however, suggest 
that RNA toxicity is a component of degeneration (5). The 
toxicity of a pathogenic, expanded polyQ Ataxin-3 protein in 
Drosophila is significantly reduced by interrupting the typically 
pure CAG-repeat sequence encoding the polyQ domain, with a 
CAA-interrupted repeat which also encodes glutamine. 
CAG-repeat RNA forms a hairpin structure which is thought 
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to confer toxicity (6,7). Interruptions of the C AG-repeat RNA by 
CAA codons are predicted to disrupt the hairpin structure (8). 
Furthermore, evidence suggests that pure or interrupted 
CAG-repeat expansions in ATAXIN-2 (ATXN2) are associated 
with distinct effects (9-12). Expanded pure CAG repeat in 
ATXN2 typifies SCA2. CAA interruptions in ATXN2 are found 
in CAG-repeat expansions that present with parkinsonism or 
amyotrophic lateral sclerosis (ALS). Taken together, these find- 
ings suggest a role of the CAG-repeat RNA in toxicity and 
effects. Additional studies indicate that simply a non-coding 
RNA bearing only an abnormally long CAG repeat but not 
coding for a protein can induce progressive neuronal dysfunc- 
tion, including shortened lifespan and locomotor defects (5). 
Interestingly, muscleblind (mbl), an RNA-binding protein 
implicated as a modifier in DM1, can dramatically enhance 
the toxicity of both the Ataxin-3 protein and non-coding 
CAG-repeat RNA in Drosophila (13,14). These findings 
suggest that the CAG-repeat RNA, in the absence of coding 
for a protein, may be a component of toxicity in polyQ 
disease. However, mechanisms of CAG-repeat RNA toxicity 
remain unclear. 

To further understand mechanisms of CAG-repeat RNA 
toxicity, we assessed transcriptional changes in flies expres- 
sing toxic expanded CAG-repeat RNA using microarray ana- 
lysis in Drosophila. This approach identified transcriptional 
changes in 160 genes specifically in the brain of flies expres- 
sing toxic CAG repeats, suggesting a robust component of 
transcriptional alterations. These findings led us to test the 
overlap between modifiers defined for protein-based polyQ 
toxicity and the toxicity of the non-coding RNA. Our findings 
highlight an overlap between the two situations, suggesting a 
component of shared mechanism. 

RESULTS 

Defining gene expression changes in response to a toxic 
non-coding CAG-repeat RNA 

To identify gene pathways involved in response to a toxic non- 
coding CAG-repeat RNA, we performed microarray analysis 
on Drosophila brains using the Affymetrix Drosophila 
Genome 2.0 Array. In order to assess early transcriptional 
changes and to avoid detecting secondary responses, pre- 
symptomatic adult flies expressing a CAG repeat in the 3' un- 
translated region (3'-UTR) of a transgene encoding the control 
protein DsRed were used (Fig. 1A). The control CAGO flies 
express a DsRed reporter gene with no CAG repeat in the 
3'-UTR. The CAG100 and CAG250 flies have a CAG repeat 
of about 100 and 250 CAG trinucleotides in the 3'-UTR, as 
previously described (5). The CAA/G105 flies are a non-toxic 
trinucleotide control and express DsRed with an interrupted 
CAA/CAG-repeat sequence in the 3'-UTR. The experimental 
design is illustrated in Figure IB. To define genes whose 
expression is altered in response to a toxic CAG-repeat 
RNA, we compared CAG 100 flies with age-matched flies 
expressing CAGO. To exclude transcriptional changes in 
response to a non-toxic trinucleotide repeat, a second gene 
list was generated by comparing CAA/G105 flies with age- 
matched CAGO flies. To then focus on the list of genes specific 
to the toxic CAG-repeat RNA, we eliminated genes that were 



differentially expressed in common to both CAG 100 and 
CAA/G105 flies. At 3 days, flies expressing CAG100 in all 
neurons of the brain with e/av-GAL4 driver show normal 
climbing ability and normal survival compared with age- 
matched control flies expressing CAGO (5). Fly brains were 
thus dissected from 3-day animals, and RNA was isolated 
for microarray analysis (Fig. 1C). To assess whether the 
brain dissection enriched for brain RNAs, the dissection 
quality was assessed by determining the relative level of ex- 
pression of an eye gene, chaoptin (chp), which is selectively 
expressed in photoreceptor cells, and the neural gene marker 
amyloid precursor protein like {appl), which is selectively 
expressed in neurons thus enriched in brain compared with 
eye tissue (15,16). Quantitative real-time polymerase chain re- 
action (qRT-PCR) analysis indicated that the relative expres- 
sion level of chp was significantly low (less than 10-fold 
compared to head RNA) in all samples, whereas the level of 
appl was enriched more than 12-fold compared with head 
RNA (Fig. ID and E). This analysis thus indicated that dissec- 
tion of the brain from the eyes successfully enriched for RNA 
expressed in the brain among the four independent replicates 
of the microarray analysis. 

The expression levels of each trinucleotide repeat transgene 
were also controlled. Transcript levels were determined by 
assessing the level of the DsRed portion of the transgenes 
by qRT-PCR. The transgenes were adjusted to match at the 
mRNA expression level, such that CAGO flies had one copy 
of the transgene and the CAG100 flies had five copies. The 
CAA/G105 line expressed the transgene at levels lower than 
those of the CAG 100. Thus, we used two copies of this trans- 
gene in order to express this RNA at a level at least as high as 
that of the CAG100 flies (Fig. IF). 

Global analysis of genes differentially expressed in flies 
expressing a toxic CAG RNA 

Genes differentially expressed in CAG100 and/or CAA/G105 
were defined by comparing RNA from brains of flies expres- 
sing CAG 100 versus CAGO and CAA/G105 versus CAGO. 
Pearson's correlation coefficient showed a strong correlation 
between the biological replicates of any one transgene 
(r > 0.99). We predicted that the transcriptional profile of 
CAA/G105 flies would be more similar to CAGO flies than 
to CAG 100 flies, because the interrupted CAA/G105 repeat 
is not toxic. Hierarchical clustering confirmed a higher simi- 
larity between the transcriptional profiles of CAGO- and 
CAA/G105-expressing flies than between CAGO- and 
CAGlOO-expressing flies (Fig. 2A). In addition, it further sup- 
ported our experimental design to exclude genes differentially 
expressed in response to a long trinucleotide repeat by using 
the interrupted CAA/G105 RNA. 

About 129 transcripts were identified that were 2-fold or 
greater upregulated and 81 that were 2-fold or greater 
downregulated in CAG 100 flies compared with CAGO flies 
(Fig. 2B). Among these, 19% (25 transcripts) upregulated 
and 31% (25 transcripts) downregulated transcripts were also 
differentially up- or downregulated in CAA/G105 flies (Sup- 
plementary Material, Table SI a). These transcripts were not 
pursued at this time; these transcripts might reflect a general 
response to expressing a long trinucleotide repeat RNA, 
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Figure 1. Microarray analysis with brain-enriched RNA. (A) DNA constructs with untranslated CAG or CAA/G repeats placed in the 3'-UTR of a control protein 
DsRed. The control CAGO flies express the control DsRed reporter with no CAG repeat in the 3'-UTR. The CAGIOO and CAG250 flies express the DsRed 
reporter with repeats of about 100 and 250 CAGs in the 3'-UTR (5). CAA/G105 flies, a non-toxic trinucleotide control, express the DsRed reporter with a tri- 
nucleotide sequence of CAA and CAG of 105 units (5). (B) Venn diagram illustrates the rationale to generate lists of genes differentially expressed in CAGIOO 
animals selectively. (C) Illustration of brain dissection process, to enrich for RNA expressed in the brain tissue. The dissection removes the eyes and the outer 
retinal lamina from the head capsule. (D and E) To verify the brain dissection quality, we quantified relative expression levels of (D) chaoptin (chp) and (E) appl 
by qRT-PCR. (D) The gene chp is expressed in the retina. Quantifying its expression in brain, compared with head in all the preparations, indicated a lack of eye 
tissue in the brain preparations. (D) The gene appl is selectively expressed in neurons and showed an enrichment in the brain preparations. (F) The expression 
levels of each trinucleotide repeat transgene were also controlled. Transcript levels were determined by assessing the level of the DsRed portion of the transgenes 
by qRT-PCR. Expression levels in (D-F) are relative to CAGO (brain CAGO was set to 1) and are presented as mean + SD (n = 4 biological replicates used in 
the microarray analysis). Genotypes of flies from left to right: e/av-GAL4 in trans to (CAGO) UAS-DsRed-CAGO, (CAGIOO) UAS-DsRed-CAGlOO (5x), (CAA/ 
G105) is UAS-DsRed-CAA/G105 (2x). 



whether toxic or not. About 160 transcripts (104 up and 56 
down) were altered selectively in CAGIOO flies (Supplemen- 
tary Material, Table Sib). About 122 transcripts (48 up and 
74 down) were altered in CAA/G 105 flies, although not in 
CAGIOO flies (Supplementary Material, Table Sic). To 
verify the microarray analysis, we selected 12 genes of a 
range of gene expression alterations to assess for expression 
changes by qRT-PCR in CAGIOO flies (Supplementary Mater- 
ial, Table S2). Five out of 12 genes expression changes, or 
42%, were positively validated in CAGIOO (P < 0.05). 
Genes for which the expression level was not changed in a sig- 
nificant manner still showed a trend consistent with the micro- 
array analysis. Eleven out of 12 genes expression changes, or 
92%, were positively correlated with the change detected in 
the microarray. 

Gene Ontology (GO) analysis based on the Database of 
Annotation, Visualization and Integrated Discovery was 
performed to provide insights into the biological processes 



affected by the genes differentially expressed selectively in 
CAGIOO flies (Supplementary Material, Table Sib) (17,18). 
We used the GO_FAT category which filters out very broad 
GO terms to identify statistically enriched functional groups. 
Several broad GO terms were enriched, including iron ion 
binding, nucleotide binding and oxidation reduction (Fig. 3). 
Genes related to ion channels and receptors were also affected 
in response to CAGIOO. Trpgamma of the transient receptor 
potential (TRP) classic channel family and CG34123 of the 
TRP melastatin-like channel family were upregulated in 
CAGIOO flies (2.4-fold change). TRP channels are essential 
for the control of extracellular magnesium levels (19). 

RNA-binding proteins are especially interesting targets for 
potential mechanisms of CAG-repeat RNA toxicity. The tran- 
scripts of two RNA-binding protein genes were downregulated 
in CAGIOO flies: RNA-binding protein 4 (-2.4-fold change), 
which has an RNA recognition motif (RRM), and CHKovl 
( — 2.3-fold change), predicted to have an RNA-binding motif 
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Figure 2. Transcriptional profiles of CAG100 and CAA/G105 flies. (A) A 
total of four independent biological replicates were examined by microarray 
analysis. Hierarchical clustering of all elements on the array by Euclidean dis- 
tance showed a higher similarity between transcriptional profiles of the same 
genotypes, regardless of the replicate. Hierarchical clustering also showed a 
higher similarity between the transcriptional profiles of CAGO and CAA/ 
G105 than between CAGO and CAG100. (B) Venn diagram highlighting the 
number of differentially expressed genes that have at least a 2-fold change 
in CAG100- and/or CAA/G105-expressing flies (step-up P < 0.05). 



and a domain of RNA-directed DNA polymerase activity. In the 
list of upregulated transcripts, expression of the RNA-binding 
protein eIF4E-5 was upregulated 4.9-fold in CAG100 flies. 

The cytochrome P450 superfamily (Cyp) is a large and diverse 
group of enzymes that catalyse oxidation. Of these, four cyto- 
chrome P450-related genes were downregulated and four were 
upregulated. Their differential expression might reflect a detoxifi- 
cation response in CAG100 flies at the pre-symptomatic stage. 
Significant upregulation of cytochrome P450-related genes has 
also been observed in the microarray analysis of a Drosophila 
model of Parkinson's disease (PD) (20). 



Hsp70, a genetic modifier of polyQ toxicity, is upregulated 
in microarray analysis 

Intriguingly, in the microarray analysis, we found that some of 
the Hsp70 genes were significantly upregulated in brains of 
animals expressing the toxic CAG100 RNA (Supplementary 
Material, Table Sib). Heat shock protein 70 (Hsp70) is a 
powerful modulator of the pathogenic polyQ protein, such 
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Figure 3. Analysis of GO categories for genes differentially expressed in 
CAG100. (A) Bar graph shows the GO terms and the number of genes in 
each category that are significantly enriched in the CAG100 gene list. The 
enrichment score (E) is the P-value in the -log scale. The P-value shows 
whether the gene list is specifically enriched in the corresponding annotation 
cluster compared with random chance. The enrichment score is > 1.3 (dashed 
line) when the P-value is <0.05. (B) Lists of genes in each enriched category 
in (A). 



that in Drosophila models for SCA3, Hsp70 becomes loca- 
lized to the NIs (4). Furthermore, upregulation of Hsp70 sup- 
presses polyQ-mediated neurodegeneration. There are 
multiple copies of the Hsp70 genes in Drosophila at distinct 
loci, including HsplOAa and Hsp70Ab at chromosomal pos- 
ition 87A and Hsp70Ba, Hsp70Bb, Hsp70Bbb and Hsp70Bc 
at 87B. We found that probe sets of transcripts for Hsp70Ba, 
Hsp70Bb and Hsp70Bbb genes were upregulated 4.06-fold. 

Given this upregulation of Hsp70 transcripts, we used a 
second technique to determine whether abnormal localization 
or levels of Hsp70 protein were detectable in flies expressing 
CAG100. Tissue immunostaining of fly brains using an anti- 
body that detects the Hsc70/Hsp70 protein family was per- 
formed on flies expressing the toxic RNA repeat (Fig. 4A). 
At 3 days, when an Hsp70 transcriptional change was detect- 
able by the microarray analysis, we did not see an alteration of 
the Hsc70/Hsp70 protein in CAG100 flies compared with con- 
trols (Fig. 4C and D). By 28 days, the brain normally shows a 
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Figure 4. Hsc70/Hsp70 accumulated in aged CAGIOO flies. (A) Cryosection of a CAGIOO fly head at 28 days, immunostained for Hsc70/Hsp70. The dashed 
outline represents the area enlarged in (C-F). (B) Graph of the mean number of Hsc70/Hsc70 particles in CAGO- and CAGlOO-expressing animals at 28 days. 
The average number of Hsc70/Hsc70-positive particles within the area shown in (E') and (F') was quantified. Error bars represent SD (*P < 0.05, Student's /-test; 
n = 3-4). (C and D) At 3 days, Hsc70/Hsp70 accumulation is not detectable in CAGO and CAGIOO fly brains. Enlarged images of the dashed outline area are 
shown in (C) and (D') (green: anti-Hsc70/Hsp70, blue: DAPI). (E and F) At 28 days, there are prominent Hsc70/Hsp70 accumulations in CAGIOO flies. Enlarged 
images of the dashed outline area are shown in (E') and (F'). Genotypes of flies: e/av-GAL4 in trans to (CAGO) UAS-DsRed-CAGO and (CAGIOO) 
UAS-DsRed-CAGlOO (5x). DAPI images are shown next to (C— F') for orientation. 



modest degree of Hsc70/Hsp70 accumulations (Fig. 4F). Strik- 
ingly, immunostaining of CAGlOO-expressing animals 
revealed abundant Hsc70/Hsp70 accumulations in the cortical 
region of the brain at 28 days (Fig. 4E). Quantification of the 
accumulations (Fig. 4E' and F') showed that the number of 
Hsc70/Hsp70 accumulations was significantly increased 
(P < 0.05) in CAGlOO-expressing flies compared with 
control animals (Fig. 4B). Thus, the microarray analysis 
revealed a transcriptional increase in Hsp70 genes in 
CAGIOO flies in 3-day pre-symptomatic animals and a 
protein translational response in Hsc70/Hsp70 in 28-day 
animals. Furthermore, these data suggested an overlap in the 
biological response of the animal to the toxic expanded 
polyQ protein and toxic expanded CAG-repeat RNA, 
namely an Hsc70/Hsp70 response. 

Hsp70 is a modifier of Ataxin-3 protein-based 
neurodegeneration and of CAG-repeat RNA-based toxicity 

To determine whether upregulation of Hsp70 could modify the 
toxicity of the CAG-repeat RNA, we co-expressed human 



Hsp70 together with a non-coding CAG250 RNA ubiquitously 
in the animal with daughterless-GAL4 (da-GAL4). CAG250 
flies express a sequence of 250 CAG trinucleotide repeats in 
the 3'-UTR of the DsRed reporter. Flies expressing CAG250 
showed a stronger toxicity than animals with tissue-specific 
expression of the RNA, thus facilitating the analysis of modi- 
fier genes. When expressed in this manner, the CAG250 repeat 
caused compromised climbing ability at day 1 that progres- 
sively deteriorated over 14 days (Fig. 5A). Upregulation of 
Hsp70 mitigated the toxic effects of the CAG250 repeat, 
such that now 61 + 5% of the animals could climb at day 1, 
compared with 38 + 7% in the absence of added Hsp70 
(Fig. 5B). The effect of Hsp70 persisted with time such that 
by 14 days, whereas normally only 12 + 7% of CAG250 
flies could climb, with co-expressed Hsp70, now 27 + 5% of 
CAG250 flies could climb. We confirmed that Hsp70 had no 
effect on the level of expression of the CAG250 repeat 
(Fig. 5C); rather, added Hsp70 mitigated the toxic effects of 
the CAG250 RNA. These data indicate that Hsp70 is a modu- 
lator of both pathogenic polyQ protein-based toxicity and tox- 
icity conferred by a non-coding expanded CAG-repeat RNA. 
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Figure 5. Hsp70 rescues climbing defects of CAG250 flies. (A) Expressing 
GFP or Hsp70 alone does not cause climbing defects. In contrast, expressing 
CAG250 causes climbing defects in fly, and this defect progress with age. 
Genotypes of flies: da-GALA in trans to (GFP) UAS-GFP, (Hsp70) 
UAS-Hsp70 and (GFP+CAG250) is UAS-GFP UAS-DsRed-CAG250. (B) 
Co-expression of Hsp70 with CAG250 suppresses CAG250 toxicity and 
rescues climbing deficits. The climbing ability of CAG250 flies was examined 
at days 1, 7 and 14. CAG2507Hsp70 flies exhibit a significant improvement in 
climbing ability compared with age-matched CAG250 flies (mean + SD, n = 
3, Student's Mest, *P < 0.05). Genotypes of flies: da-GAlA in trans to 
(CAG250+GFP) UAS-DsRed-CAG250 UAS-GFP and (CAG250+Hsp70) is 
UAS-DsRed-CAG250; UAS-Hsp70. (C) Levels of CAG-repeat RNA were 
examined by qRT-PCR and were not significantly altered when co-expressed 
with Hsp70 (3-day flies, Student's f-test, P > 0.05). Genotypes of flies from 
left to right: da-GAL4 in trans to UAS-GFP UAS-DsRed-CAG250 and 
UAS-DsRed-CAG250; UAS-Hsp70. 

Identification of Hsp70 as a modifier of the CAG-repeat RNA- 
based toxicity raised the possibility that other modifiers of the 
pathogenic polyQ protein-based toxicity may also affect CAG 
RNA-based toxicity. 



Additional modifiers of CAG-repeat RNA toxicity 
identified from a targeted modification screen 

A previous genetic screen had identified a number of modifiers 
of Ataxin-3 protein-associated neurodegeneration (21). We 
therefore performed a targeted modification screen of these 



enhancer-promoter (EP)-element insertion lines for ability 
to modulate the CAG-repeat RNA toxicity. To assess 
effects, we examined several indicators of RNA-based toxicity 
when a CAG250 repeat is expressed in the animal: wing 
posture and motility deficits. Normally, flies hold their wings 
flat to the back, rarely showing an abnormal upward wing 
posture (Fig. 6A). However, 70% of CAG250 flies displayed 
an abnormal wing posture, holding one or both wings up 
(Fig. 6A and B). This abnormality may be due to muscle- 
and/or neural-based effects. Previous findings also show that 
flies display defects in climbing ability when expressing the 
toxic CAG-repeat RNA in neurons with e/av-GAL4 (5) 
(Fig. 5A). We used these two indicators of the toxicity of 
the CAG250 RNA (held-up wings and loss of climbing 
ability) to assess whether additional previously defined modi- 
fiers of Ataxin-3 protein-based toxicity could affect RNA tox- 
icity. 

Twenty-five different modifiers were previously isolated in 
a genome-wide screen for suppressors or enhancers of patho- 
genic polyQ Ataxin-3 protein toxicity (21). Of these, three 
mitigated both the wing posture and locomotor defects of 
CAG250 flies: the co-chaperone Tpr2 EB7 ~ lA , the lin-A\ homo- 
logue dpld EP546 and RNA-binding protein orb2 B8 - s (Fig. 6D-I). 
While only 26 ± 4% of CAG250 flies had normal wing 
posture, now 66 + 10% of flies with upregulated Tpr2 
showed normal wing position (Fig. 6D). Tpr2 EB also func- 
tionally rescued the climbing defects of CAG250 flies (58 + 8% 
of Tpr2 EB7 ' 1A flies could climb, whereas only 36 + 5% of 
CAG250 flies could climb in a similar manner in parallel 
control experiments) (Fig. 6E). Upregulation of dpld with 
the dplcf 1,546 allele rescued both wing posture and locomotor 
dysfunction, such that 58 + 9% of CAG250 flies with upregu- 
lated dplcF 1 " 546 had normal wing posture, and climbing was 
restored such that 62 + 9% of animals could climb (Fig. 6F 
and G). Upregulation of orb2 with the orb2 B8 ~ s allele resulted 
in 80 + 14% of CAG250 flies now showing normal wing 
posture and 62 + 8% of animals climbing normally (Fig. 6H 
and I). By qRT-PCR, we confirmed that these modifiers did 
not affect the expression levels of the CAG-repeat RNA 
(Fig. 6C). Taken together, these data indicated that these 
modifiers, originally identified as genes that when upregulated 
can mitigate Ataxin-3 pathogenic protein-based toxicity, also 
rescue CAG-repeat RNA-based toxicity. 

The genetic screen for modifiers of Ataxin-3 protein toxicity 
was performed with a transgene bearing a pure CAG-repeat 
RNA. As these four modifiers could modulate both the toxic 
effect of this transgene and the toxic effect of a non-coding 
CAG-repeat RNA, we were interested in whether the mitiga- 
tion of the Ataxin-3 toxicity was due to a selective effect on 
the RNA versus an effect on both the protein and the RNA. 
To address this, we determined whether interrupting the 
RNA sequence of the Ataxin-3 pathogenic protein affected 
the interaction with these modulators. If these modifiers 
were no longer effective, then this would suggest that these 
modifiers were directed solely to the CAG-repeat RNA. We 
therefore co-expressed Hsp70, Tpr2 EB7 ' JA , dplcf 546 and 
orb2 B8 ~ s , with an Ataxin-3-expressing transgene in which the 
polyQ domain is encoded by a CAA/G-interrupted repeat se- 
quence. The SCA3trQ78 C AA/G transgene encodes a toxic 
polyQ protein identical in sequence to the CAG-repeat 
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Figure 6. Tpr2 EB7 ' ,A , dplif 546 and orb2 B ' ss suppress CAG-repeat RNA toxicity. (A) Abnormal wing posture (one or both wings held-up, lower panels) is 
observed in flies expressing CAG250 with a ubiquitous driver t/a-GAL4. (B) About 30% of animals expressing CAG250 have abnormal wing posture at day 
1. Genotypes of flies from left to right: rfa-GAL4 in trans to UAS-GFP and UAS-GFP UAS-DsRed-CAG250. (C) Co-expression of Tpr2 EB? -' A , dpltf 546 and 
orb2 B ~ ss , with CAG250 does not alter the CAG250 relative expression level examined by qRT-PCR (mean ± SD, n = 3, Student's Mest, P > 0.05). (D, F 
and H) Upregulation of Tpr2 EB7 ~ 1A , dpltf 546 and orb2 B ' ss significantly rescued abnormal wing posture of CAG250 flies at day 1. (E, G and I) Co-expression 
of Tpr2 EB , dplcf 546 and orb^ significantly rescue climbing defects of CAG250 flies at day 1 (mean + SD, n = 3 independent experiments, Student's Mest, 
*P<0.05). Genotypes of flies: rfa-GAL4 in trans to (CAG250+GFP) UAS-DsRed-CAG250 UAS-GFP, (CAG250+ Tpr2 Esf ' IA ) UAS-DsRed-CAG250 
Tpr2 EB? -' A , {CKG250+dpld E546 ) UAS-DsRed-CAG250 dplcf 546 and (CAG250+orZ>2 fl -^) UAS-DsRed-CAG250; orb2 B -" s . 



encoding transgene SCA3trQ78 C AG, differing only in whether 
the RNA is a pure CAG or interrupted by CAA codons. 
Expression of SCA3tr Q78 C aa/g in the eye by gmr-GAL4 
gave a disrupted eye effect (Fig. 7A), and co-expression of 
these modifiers mitigated the toxicity (Fig. 7B-E). These 
data indicate that these modifiers can modulate both 
Ataxin-3 pathogenic protein-based toxicity and CAG-repeat 



RNA-based toxicity, highlighting the overlap between RNA- 
based and polyQ protein-based toxicity mechanisms. 

DISCUSSION 

PolyQ diseases have been considered toxic polyQ-based disor- 
ders, although recent data suggest that the expanded CAG 



Human Molecular Genetics, 2011, Vol. 20, No. 24 4817 



SCA3trQ78cAA/G 



+ Hsp70 Tpr2 EB71A orb2 B8S dpld E546 

A B i c ii m 

Figure 7. Hsp70, Tpr2 EB7 ' ,A , dplcf 546 and orb! 6 '* 18 suppress both CAG-repeat RNA-based toxicity and polyQ protein toxicity. (A) Eyes of flies expressing 
SCA3trQ78 CAA/G , the toxic polyQ protein expressed from a transgene with an interrupted CAA/G polyQ repeat region, have degenerate eyes with reduced 
pigmentation. (B-E) Upregulation of Hsp70, Tpr2 EB7 ~ jA , dplcf 546 and orb2 B ' ss ameliorates the degenerative effect of SCA3trQ78 C AA/o- Genotypes of flies 
(A-E): gmr-GAL4 SCA3trQ78 CAA/a in trans to +, UAS-Hsp70, Tpr2 EB? -' A , dplcf 546 and orbl 3 ^. 



trinucleotide repeat RNA encoding the polyQ domain may 
contribute to the disease (5). Given that mechanisms of 
RNA toxicity remain unclear, we provided insights into the 
nature of the CAG-repeat RNA toxicity by performing a 
microarray analysis to define the global transcriptional 
profile of the fly brain upon expression of toxic trinucleotide 
repeat RNA. These studies revealed that 160 genes were 
altered, including the Hsp70 genes. A targeted genetic 
screen then defined Hsp70, as well as a number of other 
genes, as modifiers of both the polyQ protein and toxic 
CAG-repeat RNA. These data indicate an overlap between 
mechanisms of the protein-based and RNA-based toxicities 
that may contribute to polyQ disease. 

Microarray analysis identifies altered Hsp70: a modifier of 
CAG-repeat RNA-based toxicity 

Our microarray analysis identified Hsp70 as upregulated in 
CAG100 flies. To confirm these findings, we performed immu- 
nostaining analysis for the Hsc70/Hsp70 protein in brains of 
animals expressing the toxic RNA. This revealed age-dependent 
accumulations. These data indicate that upregulated Hsp70 
induced by the toxic CAG repeat occurs at both transcriptional 
and translational levels. Given that Hsc70/Hsp70 accumulations 
are induced by expressing CAG 100, and Hsp70 is a powerful 
suppressor of several human neurodegenerative diseases, we 
then performed functional analysis. These findings showed 
that upregulation of Hsp70 rescues the locomotor dysfunction 
caused by the toxic CAG repeat. Similarly, in a Drosophila 
model of fragile X syndrome, which is also an RNA-mediated 
neurodegenerative situation, upregulation of Hsc70 modifies 
the CGG-repeat RNA-induced effects (22). Taken together, 
these data suggest that Hsc70/Hsp70 chaperones can modify 
toxicity conferred by both pathogenic RNA and proteins. 
Hsp70 is thought to prevent aggregation or to assist in the refold- 
ing of misfolded protein. Possible mechanisms of suppression 
by Hsp70 in RNA-mediated toxicity might involve a misfolded 
protein response triggered by deleterious consequence of the 
pathogenic repeat RNA. 

Other ways in which Hsp70 may suppress include modula- 
tion of death pathways (23). Notably, several genes associated 
with the regulation of apoptosis were in the list of genes 
affected in the CAG 100 flies. Upregulation of cytochrome c 
transcript is a general early apoptotic response, and studies 
show that Hsp70 suppresses apoptosis by acting downstream 



of cytochrome c release in mammalian cells (24,25). Intri- 
guingly, cytochrome c distal (Cyt-c-d), one of the Drosophila 
cytochrome c genes, was upregulated pre-symptomatically in 
CAG100 flies in the microarray analysis (Cyt-c-d, 2.0-fold 
change). Moreover, upregulation of Hsp70 suppresses the 
symptomatic effects of toxicity of CAG250 flies. Potentially, 
Hsp70 may suppress RNA-based toxicity through modulation 
of apoptosis processes as an additional mechanism. 

The molecular chaperone Hsp70 responds to several neurode- 
generative disease situations in animal models. In the 
Drosophila model of SCA3, Hsp70 is localized to the nuclear 
accumulations, whereas normally there is a low level of 
Hsp70 in the cytoplasm (4). Upregulation of Hsp70 suppresses 
the toxicity caused by the pathogenic Ataxin-3 protein. In 
mouse models of spinal bulbar muscular atrophy (SBMA), 
due to polyQ expansions in the androgen receptor gene, 
Hsp70 levels are increased in the muscle, and upregulation of 
Hsp70 ameliorates pathogenicity of the SBMA model (26). In 
the Drosophila model of fragile X-associated tremor/ataxia 
syndrome, Hsp70-containing inclusions are induced by expres- 
sing long CGG repeats in the 5'-UTR of an enhanced green 
fluorescent protein (EGFP) reporter gene. Upregulation of 
Hsc70 modifies CGG-repeat-induced degeneration (22). RnrL, 
which suppresses non-coding CTG1 12 RNA-induced neurode- 
generation in a Drosophila model for SCA8, was also identified 
in our present study (RnrL, 7.4-fold change) (27). These data 
suggest that candidate genetic modifiers can be identified 
through microarray analysis, defining altered genes in the 
disease-associated condition. 

A recent study shows that modifier of mdg4 [mod(mdg4)] 
transcript levels is altered in Drosophila heads upon expres- 
sion of CAG-, CUG- and AUUCU-repeat RNA and that 
mod(mdg4) is an enhancer of CAG- and CUG-repeat toxicity 
(28). We did not find that mod(mdg4) met our criteria in gen- 
erating gene lists; however, we used only brain tissue and not 
heads with eye tissue included. Transcriptional profiles may 
differ in distinct cell types in response to toxic elements, 
and brain-enriched RNA may reveal distinct or more subtle 
changes in gene expression that are selective to neurons. 

Candidate targets for further investigation of RNA-based 
toxicity mechanisms 

Several functional classes of genes were differentially 
expressed in CAG100 flies (Fig. 3). These classes give insights 
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into molecular pathways that respond to the toxic CAG-repeat 
RNA. For example, microglial activation and innate immune 
system activation are recognized as a disease feature in Alz- 
heimer's (AD), PD, HD and ALS. Anti-inflammatory treat- 
ments may reduce susceptibility to AD and PD (29). 
Proteins involved in the innate immune response could be 
used as molecular biomarkers to identify pre-symptomatic 
stage of AD (30). Beyond changes in the cytochrome P450 
superfamily, there were three upregulated transcripts asso- 
ciated with innate immune response, including attacin, 
Immune-induced molecule 2 and metchnikowin. These gene 
lists not only provide candidate targets for further genetic ana- 
lysis, but might also provide insights into potential 
CAG-repeat RNA pathogenesis and the common characteristic 
of neurodegenerative diseases. Furthermore, a set of genes was 
selectively affected in CAA/G105 and not in CAG100 or 
CAGO (Supplementary Material, Table Sic). These transcripts 
may be relevant to situations in which the polyQ repeat se- 
quence in the disease gene is interrupted, as in ATXN2 
polyQ expansions that present with parkinsonism or ALS 
rather than SCA2. Future studies may reveal insights with 
regard to potential pathways modulated. 

Expanded CAG-repeat RNA and pathogenic polyQ protein 
trigger common pathways 

Beyond Hsp70, three genetic modifiers identified in the 
present study, Tpr2 EB7 -' A , dpld EF546 and orb2 B8 ~ s , also sup- 
press both CAG-repeat RNA-based toxicity and polyQ 
protein-based toxicity. Previous studies show that these modi- 
fiers do not alter the Hsp70 expression level, thus function in a 
distinct manner (21). dTPR2 is homologous to the human tet- 
ratricopeptide repeat protein 2. Tpr2 is a co-chaperone con- 
taining two chaperone-binding TPR domains and a DnaJ 
homologous J domain (31,32). In previous studies, Hsp70 
and five other modifiers of Ataxin-3 protein-associated neuro- 
degeneration were categorized in the chaperone class (21). 
Surprisingly, two of these six also suppressed CAG-repeat 
RNA toxicity (Hsp70 and Tpr2 EB7 ' 1A ) (Fig. 8). This raises a 
role for chaperones and ubiquitin-associated pathway compo- 
nents in CAG-repeat toxicity, although likely specific genes 
involved reflect specific processes primarily detected. 

dpld 54 was previously identified as a strong suppressor of 
Ataxin-3 toxicity and could suppress tau toxicity as well as a 
cell-death gene (hid). Intriguingly, data indicate that the 
mouse homologue of dpld (lin-41) is localized to P-granules, 
which are sites of RNA processing and degradation (33). 
This overlap between dpld as a modifier of polyQ protein tox- 
icity and the CAG-RNA-based toxicity, coupled with the 
implication of P-granule involvement, supports the idea of 
RNA pathways going awry in both situations. 

Drosophila orb2 is also known as Drosophila cytoplasmic 
polyadenylation element-binding protein (CPEB). CPEB 
belongs to a family of RNA-binding proteins which have two 
RRMs and a Zn-finger domain (34). CPEBs regulate translation 
of target mRNAs (35) and function in many processes, including 
synaptic plasticity, learning and memory (36-38). Recent 
studies show that Drosophila Orb2 binds directly to transcripts 
associated with long-term memory, and Orb2 targets are asso- 
ciated with neuronal growth, synapse formation and protein 
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Figure 8. Modifiers of Ataxin-3 protein-associated neurodegeneration affect 
both polyQ protein toxicity and CAG-repeat RNA-based toxicity. A previous 
genetic screen identified a number of modifiers of Ataxin-3 protein-associated 
neurodegeneration. We therefore performed a targeted screen of these modu- 
lators for ability to affect the CAG-repeat RNA toxicity. Hsp70 and another 
1 8 genes were identified as suppressors of Ataxin-3 protein-based neurodegen- 
eration. Among these 1 9 modulators, Hsp70, Tprf :B? -' A , dplcF 546 and orb2 BS - s 
were found in the present study to also affect the CAG-repeat RNA-based tox- 
icity. Previous studies also demonstrated that mbl affects the toxicity of both 
the Ataxin-3 protein and non-coding CAG-repeat RNA. These genes that can 
modulate both polyQ protein and CAG-repeat RNA-based toxicities highlight 
an overlap between RNA-based and polyQ protein based-toxicity mechanisms. 



turnover (39). Taken together, these studies indicate that Orb2 
is broadly involved in various neuronal functions. Moreover, 
Orb2 is co-localized in dFMRl -containing (fragile X mental re- 
tardation 1) foci in cultured Drosophila primary neurons (40). 
Fragile X mental retardation protein has been implicated in 
the translational regulation of target mRNAs in neurons and is 
the protein affected in fragile X syndrome (41-43). The 
pathogenic role of neuronal RNA granules in neurological 
disease suggests the possibility that CAG-repeat RNA might 
interfere with the normal function of Orb2 in neuronal RNA 
granules to cause toxicity. Homologues of mbl, which modifies 
Ataxin-3 protein toxicity, CAG-repeat RNA toxicity and 
CUG-repeat RNA toxicity, co-localize with RNA stress gran- 
ules in cell culture (44,45). Thus, mbl may be broadly involved 
in other aspects of RNA metabolism that contribute to neurode- 
generative disease. Future investigation to examine effects 
of the CAG-repeat RNA, and neural granules and P-bodies, 
together with studies of Orb2 will reveal further insight. 

Intriguing recent data raise the possibility that expanded 
repeat RNAs may support protein translation in a manner 
not dependent on an AUG, as with classical protein-coding 
genes. That is, the hairpin repeat structure of triplet-repeat 
RNAs of a variety of sequences can lead to protein translation 
of the repeat region in all three reading frames. This so-called 
repeat-associated non-ATG translation (RAN translation) is 
reported to occur with SCA8 and DM1 CAG expansion tran- 
scripts (46). In the fly, previous studies failed to detect a 
protein generated by the non-coding CAG-repeat RNA (5); 
however, these studies were performed on young animals. 
Similarly, in a mouse model of CAG-repeat RNA toxicity, a 
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polyQ-containing protein was not detected upon expression of 
CAG repeats in the 3'-UTR of the EGFP gene (47). RAN, 
however, may be selective to specific cell subtypes. Our 
finding of accumulation of Hsc70/Hsp70 with age in 
CAG-repeat expressing flies, suggestive of late-onset 
accumulation of misfolded protein, may be indicative of 
such mechanisms, as is the overlap in modifiers of the protein- 
based toxicity and RNA-based toxicity. Our studies provide a 
foundation to address the extent to which this mechanism, as 
well as others, contribution to the toxicity conferred by the 
expanded repeat CAG-RNA, and its integration with protein- 
based toxicity mechanisms. 

MATERIALS AND METHODS 

Drosophila stocks and crosses 

General fly lines were from Bloomington Drosophila Stock 
Center and maintained at 25 °C on standard medium, unless 
otherwise indicated. Transgenic lines UAS-DsRed-CAGO, 
UAS-DsRed-CAGlOO, UAS-DsRed-CAG250, UAS-DsRed- 
CAA/G105 and genetic modifier EP insertion lines have been 
described (5,2 1). Control CAGO flies express a transgene encod- 
ing a DsRed reporter protein with no CAG repeat in the 3'-UTR. 
CAG100 and CAG250 flies express transgenes encoding the 
DsRed reporter with repeats of approximately 100 and 250 
CAGs in the 3'-UTR, as described previously (5). CAA/G105 
flies, a non-toxic trinucleotide control, express a transgene 
encoding DsRed with a repeat sequence of CAA and CAG of 
105 triplet codons in the 3'-UTR, as described (5). 

RNA isolation 

Four independent replicates of flies expressing CAGO, 
CAG100 or CAA/G105 by e/av-GAL4 were raised at 25°C 
with 12 h/12 h light-dark cycle (5). Three-day female flies 
from four independent crosses were collected and brains dis- 
sected. Fly brain tissue (about 20 brains per sample, dissected 
from head capsule, eyes, lamina and outer medulla removed) 
was isolated in cold phosphate-buffered saline and stored in 
Trizol Reagent (Invitrogen, Carlsbad, CA) at — 80°C. Total 
brain RNA was extracted and purified using Trizol Reagent 
(Invitrogen) and the RNeasy Mini system (Qiagen, Valencia, 
CA) and treated with RNase-free DNase I (Qiagen). cDNA 
amplification (Ovation Pico WTA systems, NuGEN, San 
Carlos, CA), labelling and hybridization (Encore Biotin 
Module, NuGEN) were carried out by the microarray core fa- 
cility at University of Pennsylvania. 

Microarray analysis 

Affymetrix Drosophila Genome 2.0 microarrays (Affymetrix) 
were used for the microarray analysis. Normalization of 
probe signal intensity levels across arrays was performed 
using the GC robust multichip average algorithm implemented 
in Partek Genomic suite 6.2 beta. Principle component analysis 
and Pearson's correlation coefficient were used to reveal out- 
liner samples. Three-way analysis of variance (ANOVA) con- 
firmed that the source of variation was not due to different 
batches of RNA. To generate lists of genes differentially 



expressed in different conditions, CAG 100 and CAG250 
were compared with CAGO at the corresponding age (step-up 
P < 0.05). We used one-way ANOVA implemented in Partek 
Genomic suite 6.2 beta to generate a list of differentially 
expressed genes under each condition. Unannotated probe 
sets were discarded. When multiple probe sets were assigned 
to the same gene, the probe set with the smallest step-up 
P-value was included. Gene lists were restricted to genes that 
have at least a 2-fold change and step-up P-value less than 
0.05. GEO accession number for the data is GSE31875. 

Quantitative RT-PCR 

qRT-PCR was performed on 3 -day adult fly brain RNA. Brain 
dissections and total brain RNA were prepared and extracted, 
as described earlier. cDNA was synthesized using the High Cap- 
acity cDNA reverse transcription kit (Applied Biosystems, 
Foster City, CA, USA). qRT-PCR was performed using 
SYBR Green master mix (Applied Biosystems). All RNA 
samples were analysed in triplicate or quadruplicate using a 
7500 Fast Real-Time PCR System (Applied Biosystems). 
Data were analysed using the relative quantification method. 
The endogenous control was ribosomal protein 49 (Rp49). 
Primers identified using Primer Express 2.0 (Applied Biosys- 
tems) were synthesized by Integrated DNA Technologies, Inc. 
Primer sets are as follows: 

rp49 forward: 5' -CAACATCGGTTACGGATCGA-3' , rp49 
reverse: 5'-AATCCGGTGGGCAGCAT-3'; Roel forward: 5'- 
CCTAGTTTCT AAGAGATCTGTGACTG-3 ' , Roel reverse: 
5'-GCCACGCGTATGGTACAAAT-3'; or63a forward: 5'-TG 
GAAC AAGCC ACATCTGAG-3' , or63a reverse: 5'-TGTTAA 
CCTCGGTTGC AAAGT-3 ' ; NFAT forward: 5'-ACACAATC 
CACGCCCAAG-3', NFAT reverse: 5'-GAGCCACTCGCA 
TTGGAC-3'; MTK forward: 5'-CCACCGAGCTAAGATGCA 
A-3', MTK reverse: 5'-TCTGCCAGCACTGATGTAGC-3'; 
CTCF forward: 5'-GCTACACGTTCCGTGGTCA-3', CTCF 
reverse: 5'-CGCTTTCGGTGTCGATTT-3' ; CG9935 forward: 
5'-GCAGACACTGGAACAATCAC A-3' , CG9935 reverse: 
5'-GGACCTGTCAGACCTTTCCA-3'; CG6656 forward: 5'- 
CGACCGATTGCAAAACATAA-3', CG6656 reverse: 5'-C 
AACCCAGAACGACACACAC-3' ; CG30104 forward: 5'-CA 
TGGATGTGGAGTTCAACG-3', CG30104 reverse: 5'-GA 
TGCCCTGTGCCTTCAG-3' ; CG18343 forward: 5'-ACGCA 
AT ACTTTTCGGCTTC-3 ' , CG18343 reverse: 5'-AGGGCGA 
TAACCACTTTGAG-3' ; CG12428 forward: 5'-CGCAA 
GTTCAACGACATTTTC-3' , CG12428 reverse: 5'-TTCATA 
GCTTGGATTCCTTG-3' ; CG11893 forward: 5'-CTGGAA 
TGGACAACTTTTTGAA-3', CG11893 reverse: 5'-TTCTC 
AAAGTGTGGCTTGTACTTT-3' ; 312 forward: 5'-TGGA 
TCACTATTACGGCCTCA-3' , 312 reverse: 5'-GTCGTCCA 
CATTCTCCTTGC-3'; DsRed forward: 5'-CCTCCTCCGA 
GAACGTCATC-3', DsRed reverse: 5'-CCCTCCATGCGCA 
CCTT-3'. 

Histochemistry and image quantification 

Cryosections were performed as described (48). For immunos- 
taining, the primary antibody was anti-Hsc70/Hsp70 
(SPA-822, 1:200, Stressgen-Enzo Life Sciences, Farmingdale, 
NY). The secondary antibody was anti-mouse conjugated to 
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Alexa Fluor 488 (1:100, Molecular Probes, Eugene, OR). The 
number of Hsc70/Hsp70-positive particles was quantitated 
using the particle analysis method implemented in Image J 
(NIH). The thresholds were adjusted based on background 
signals in the central brain region. Images from three to four 
animals per genotype were analysed. 

Climbing assay 

Flies expressing the repeat gene by da-GAL4 were raised at 
20°C to prevent lethality at pre-adult stages and shifted to 
29°C after 1 day for a stronger effect and for time considerations. 
About 8-20 female flies were anaesthetized by CO2 and placed 
in a plastic vial. After a 1 h recovery period, flies were banged to 
the bottom of the vial, and then the number of flies that climbed 
higher than a 2 cm threshold in 10 s was scored. Three trials 
were performed at 5 min intervals for each cohort of animals. 
At least, 100 flies per genotype were examined in each repeat. 

SUPPLEMENTARY MATERIAL 

Supplementary Material is available at HMG online. 
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